首页> 外文OA文献 >A Hardware-Software Integrated Solution for Improved Single-Instruction Multi-Thread Processor Efficiency
【2h】

A Hardware-Software Integrated Solution for Improved Single-Instruction Multi-Thread Processor Efficiency

机译:硬件-软件集成解决方案,可提高单指令多线程处理器效率

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This thesis proposes using an integrated hardware-software solution for improving Single-Instruction Multiple-Thread branching efficiency. Unlike current SIMT hardware branching architectures, this hardware-software solution allows programmers the ability to fine tune branching behavior for their application or allow the compiler to implement a generic software solution. To support a wide range of SIMT applications with different control flow properties, three branching methods are implemented in hardware with configurable software instructions. The three branching methods are the contemporary Immediate Post-Dominator Re-convergence that is currently implemented in SIMT processors, a proposed Hyper-threaded SIMT processor for maintaining statically allocated thread warps and a proposed Dynamic Micro-Kernels that modified thread warps during run-time execution. Each of the implemented branching methods have their strengths and weaknesses and result in different performance improvements depending on the application. SIMT hyper-threading turns a single SIMT processor core into multiple virtual processors. These virtual processors run divergent control flow paths in parallel with threads from the same warp. Controlling how the virtual processor cores are created is done using a per-warp stack that is managed through software instructions. Dynamic Micro-Kernels create new threads at run-time to execute divergent control flow paths instead of using branching instructions. A spawn instruction is used to create threads at run-time and once created are placed into new warps with similar threads following the same control flow path.This thesis\u27s integrated hardware-software branching architectures are evaluated using multiple realistic benchmarks with varying control flow divergence. Synthetic benchmarks are also used for evaluation and are designed to test specific branching conditions and isolate common branching behaviors. Each of the hardware implemented branching solutions are tested in isolation using different software algorithms. Results show improved performance for divergent applications and using different software algorithms will affect performance.
机译:本文提出使用一种集成的软硬件解决方案来提高单指令多线程分支效率。与当前的SIMT硬件分支体系结构不同,此硬件软件解决方案使程序员能够为自己的应用程序微调分支行为,或者允许编译器实现通用软件解决方案。为了支持具有不同控制流属性的各种SIMT应用程序,在具有可配置软件指令的硬件中实现了三种分支方法。这三种分支方法是当前在SIMT处理器中实现的当代即时后统治者重新收敛,提议的用于维护静态分配线程扭曲的超线程SIMT处理器以及提议的在运行时修改线程扭曲的动态微内核执行。每种已实现的分支方法都有其优点和缺点,并且会根据应用程序而导致不同的性能改进。 SIMT超线程将单个SIMT处理器内核转变为多个虚拟处理器。这些虚拟处理器与来自同一经线的线程并行运行不同的控制流路径。使用通过软件指令管理的每个线程堆栈来控制如何创建虚拟处理器内核。动态微内核在运行时创建新线程以执行不同的控制流路径,而不使用分支指令。一条spawn指令用于在运行时创建线程,一旦创建,它们就会按照相同的控制流路径被放入具有类似线程的新线程束中。本文使用具有不同控制流的多个实际基准对集成的硬件-软件分支架构进行了评估。分歧。合成基准还用于评估,旨在测试特定的分支条件并隔离常见的分支行为。每个硬件实现的分支解决方案都使用不同的软件算法进行了隔离测试。结果表明,针对不同应用程序的性能有所提高,并且使用不同的软件算法会影响性能。

著录项

  • 作者

    Steffen, Michael;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号